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The present invention provides a process for determining the quantitative and 
qualitative profile of the repertoire of a given type of an immunoglobulin (Ig) 
heavy chain expressed by a lymphocyte B population present in a tissue sample, 
and kits and uses thereof. It also provides a set of VH forward primers 
5 associated with a CH reverse primer, which are respectively capable of 

specifically hybridizing in stringent conditions with the nucleic acids encoding 
the variable segments of lg heavy chains and wiLh the constant segment of a 
given type of Ig heavy chain, such as preferably IgM, IgG, IgE or TgA heavy 
chain. Methods for the in vitro diagnosis of a condition associated with an 
1 0 abnormal expression of the repertoire of a given type of Ig heavy chain, and for 

the in vitro follow-up of a treatment of such a condition, are also comprised 
herein. 



An essential feature of the immune system is the capacity to recognize 
15 specifically a large number of antigens. In vertebrates, the B lymphocyLes partly 

execute this function of recognition, by means of the immunoglobulins. 

To the tremendous variety of antigens to be recognized, there corresponds a 
very wide potential diversity of these immunoglobulins. Indeed, 

20 immunoglobulins are composed of four peptide chains, the NH2 terminal 

domains of which are highly variable. It is the existence of a strong interaction 
between a given antigenic determinant and the site consisting of these variable 
domains which constitutes the expression at molecular level of the phenomenon 
of recognition. Thus, the information contained in the genome of a mouse 

25 enables it lo produce potentially at least 10 n immunoglobulins of different 

variable region* 

The notion of repertoire thus emerges: the set of immunoglobulin variable 
regions present at a given instant in an organism for a given type of 
30 immunoglobulin constitutes the current repertoire of immunoglobulins of said 

given type. 
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The diversity of B cell repertoire is strictly correlated with the diversity of the 
antibodies Ihey express and produce. Antibody diversity is achieved in multiple 
ways. The rearrangement of V gene segments together with D and J segments 
5 for the heavy chain and V genes and J segments for the light chain allows 

combinational diversity by association of different sets of genes. Die two 
reeombinase activating genes RAG1 and RAG2 are responsible for the V(D)J 
recombination process (Schatz and al. (1989). Cell 59, 1035-48; Oetlinger and 
ah (1990). Science 248, 1517-23). Imprecise junctions of those gene segments, 
10 either by nibbling or by random addition of nucleotides by the Tdt enzyme 

increase again the diversity level (Boilum, F. J. (1978). Adv Enzymol Relat 
Areas Mol Biol 47, 347-74; Komori and al.. (1993). Science 261, 1171-5; 
Gilfillan and ah (1993). Science 26), 1175-8). Antigen-driven affinity 
maturation introducing somatic hypermutations (SHM) in the V region is 
15 another step in generating diversity, as well as the process of heavy and light 

chain pairing (Wu and al. (2003). J Clin Immunol 23, 235-46). Class switch 
recombination (CSR), resulting in the production of several antibody isotypes, 
increases the diversity and functionality of the D ceil repertoire, allowing one 
given variable region to be associated with different constant regions (Honjo 
20 and al. (2002) Anna Rev Immunol 20, 165-96). Finally, in some species- 

antibody diversification is mainly achieved by gene conversion (Thompson and 
Neiman (1987). Cell 48, 369-78; Reynaud and al. (1987). Cell 48, 379-88). 
Recently, it has been demonstrated that the three processes, SHM, CSR and 
gene conversion, require a single enzyme, the activation-induced cytidin 
25 desaminasc (AID) (Muramatsu and al. (2000). Cell 102, 553-63; Revy and al. 

(2000). Ceil 102, 565-75; Arakawa and al. (2002). Science 295, 1301-6; 
Okazaki and al. (2002). Nature 416, 340-5; Yoshikawa and al. (2002). Science 
296, 2033-6). While V(D)J recombination and Tdt activity contribute to 
diversity generation of both T and B cell repertoires (Tuaillon and Capra (2000). 
30 J Immunol 164, 6387-97; Cabaniols and al. (200)). J Exp Med 194, 1385-90), 

SHM, CSR and gene conversion are B cell specific mechanisms, lliese 
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additional mechanisms could eventually account for an increased diversity of 
the B cell repertoire as compared with that off cells. 

It represents a monumental task to describe the collective antibodies (Ab or 
5 immunoglobulins Ig) expressed at a given moment in a subject, since the 

immunoglobulin repertoire probably contains millions of different molecules. 
Only a small number of reagents capable of specifically recognizing elements of 
such a repertoire is as yet available. It is, of course, possible to work more finely 
by determining the sequence of a certain number of expressed genes. However, 
10 practical considerations make it scarcely conceivable to analyze routinely more 

than about ten or, perhaps, a hundred genes, and the operation is expensive and 
very lengthy. In short, the repertoire of immunoglobulins is described at the 
present time only by means of a small number of parameters. 

15 Hence methods are not available which permit a rapid and effective analysis of 

the physiological and pathological situations associated with the statement of 
these repertoires, for example, it is dear thai these repertoires vary during a 
voluntary immunization (vaccine), or during infection by pathogenic 
microorganisms, or during the progress of autoimmune pathologies, in the latter 

20 case, there are many reasons to believe that predisposition to these diseases 

mirrors a certain composition of the repertoires. It is hence very probable that a 
good method of analysis of the repertoires would have medical spin-oil's, and 
could form the basis of techniques of medical analysis and of diagnosis. 

25 Human T cell repertoire diversity has been analysed using different approaches 

(sec the european patents granted under the numbers EP 566685 and £P 
672184). Limiting dilutions analyses have shown that CD4 T cells from healthy 
individuals display a highly diverse TCR-0 repertoire (Wagner and ah (1998). 
Proc Natl Acad Sci U S A 95, 14447-52). Baaed on Immunoscope technologies, 

30 the inventors have estimated the lower limit of the total otP TCR diversity in 

humans to be 25xl0 6 different TCRs (Arstila and al. (1999). Science 286 y 958- 
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61]- Furthermore, the overall diversity of TCR-Ii memory CD8 T eell repertoire 
was estimated Lo be 5.10 4 to 10 s clonotypes (Arslila and al. (1999). Science 286, 
958-61; Baron and al. (2003). Immunity 18, 193-204). On the contrary, little is 
known about the size and diversity of the peripheral blood human B cell 
5 repertoire. Most of the studies on B cell repertoire have been performed cither in 

pathological situations or following repeated immunizations, preventing any 
extrapolation on the global size and diversity under physiological conditions. 

The contribution or somatic mutations to B cell repertoire diversity has been 

10 widely studied, but most studies have been carried out in murine models, 

determining the proportion of a mutated VH gene by a compararison with its 
germ line sequence, as first shown by Gojobori. et al. (1986, Mol Biol Evol 3, 
156-67). The conclusions of those studies was that peripheral blood 
lymphocytes from most mice are unmutatcd (less than 5%) (Schittek and 

15 Rajewsky (1992). J Exp Med 176, 427-38) and that most mutated B 

lymphocytes are localized in germinal centers (Berek and al. (1991). Cell 67 \ 
1 121-9; MacLcnnan and Gray (1986). Immunol Rev 91, 61-85). A more recent 
study has focused on the contribution of somatic mutations to the diversity of 
murine scrum immunoglobulins (Williams and al. (2000). Immunity 13, 409- 

20 1 7). Mutations do not seem to be involved in the diversity of serum lgM, IgG 

and TgA in young mice, but they accumulate with age in response to 
environmental antigens and in much higher proportions within IgG than in lgM. 
(Williams and al. (2000). Immunity 13 9 409-17). In humans, the proportion of 
peripheral blood B lymphocytes bearing somatic mutations is much higher 

25 compared to mice, reaching up to 40% of all peripheral B lymphocytes, and they 

were shown to express the CD27 marker. (Klein and al. (1997). Blood 89, 1288- 
98; Klein and al. (1998). J Exp Med 188, 1679-89). However, these studies did 
not bring information on the global B cell clone size and on the proportion of 
clonal expansion within a non antigen selected rearrangement. 



30 
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Similarly, two studies have been conducted for the immunoglobulins, namely 
Loembe and al, Eur J Immunol. 2002 Dec;32(12):3678-88, and Fais and al, J 
Clin Tnvest. 1998 Oct 15; 1 02(8): 15 15-25. 

5 The immunoscopc method, based on PGR techniques, allows the quantitative 

study of the lymphocytes T repertoires by determining the length of the CDR3 
regions and TCR, quantifying the use of the variable segments and optionally 
elucidating their complete sequences. The use of these methodologies for 
studying the human immunoglobulin repertoires and the possibility to quantify 
1 0 the use of the variable segments in the formation of the immunoglobulin heavy 

chains, although often mentioned, have never realized. 

The object of the present invention is thus to provide the tools which are 
necessary for enabling the quantitative study of the lymphocyte B repertoire in 

15 mammals. These studies can be conducted in physiological conditions, for 

example within the context of the follow-up of a vaccine whose protective 
power comes from the production of specific antibodies, or in pathological 
conditions, tor example when an auto-immunc disease arises with the 
production of autoantibodies. Thus, the new application range described herein 

20 allows the follow-up of pathologies which arc specifically derived from the 

immunoglobulin expression by the B lymphocytes and therefore distinct from 
that which are specifically derived from the T lymphocyte expression. 

Thanks to their sensibility, the process of the present invention allows to 
25 determine the quantitative and qualitative profile of the repertory of a given type 

of an inununoglobulin heavy chain expressed by a lymphocyte B population, 
thereby determining the specific clones of lymphocyte D and the nucleic acid 
sequences of the genes encoding the immunoglobulins. This determination 
allows to follow more individually each of the cellular clones in various 
30 samplings of a given subject. 
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The present invention provides advantages over the prior art: firstly, the 
obtention of quantitative results doesn't require DNA sequencing, unlike the 
"unique cell PCR technique^, which is much more difficult to. perform and 
which is only available with populations previously sorted. Secondly, the results 
5 can be deduced from a large number of cells and thus are more significative in a 

statistical point of vue, always in comparison with the "unique cell PCR 
technique". Thirdly, because no antibody specific lor the VH subgroups exists, 
the FACS quantification technique used for studying the specific lymphocyte T 
VP subgroups is not applicable to the 13 lymphocytes. And finally, unlike the 

10 BUS A technique which allows to describe at the proteic level the specificity 

and the type of the circulating antibodies, the B immunoscope technology 
allows to quantitatively analyse at the mRNA level the repertoire of the 
intracytoplasmic immunoglobulins or of the immunoglobulins expressed at the 
surface of the B lymphocytes., thereby providing earlier and more detailed 

1 5 informations of the activity state of the genes concerned. 

As a consequence, the present invention provides the obtention of quantitative 
and qualitative informations directly from the lymphocyte B repertoire without 
any sequencing, it provides a rapid method, and gives the possibility to 
20 distinguish and quantify the VH repertoire specific of each immunoglobulin 

type individually or as a whole. 

Thus, a first object of the present invention is to provide a process for 
determining the quantitative and qualitative profile of the repertoire of a given 
25 type of an immunoglobulin heavy chain expressed by a lymphocyte B 

population present in a tissue sample, characterized in that iL comprises the 
following steps: 

(a) obtaining either the cDNA from ihc mRNA expressed from the tissue 
sample or the cellular DNA extract of the tissue sample, 
30 (b) performing the amplification of the cDNA obtained at the step (a) with a 

set of VH forward primers capable of specifically hybridizing in 
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stringent conditions with the nucleic acids encoding the variable 
segments (VH) of immunoglobulin heavy chains, said variable segments 
being distributed among VH subgroups, associated with a CH reverse 
primer, or a mixture thereof, capable of specifically hybridizing in 
5 stringent conditions with the nucleic acid encoding the constant segment 

(CH) of a given type of an immunoglobulin heavy chain, and 
(c) determining the quantitative and qualitative profile of the repertoire of 
said type of immunoglobulin heavy chain for each VH subgroup. 

The terms of antibody and immunoglobulin, which arc indifferently used herein, 
refer to the association of two heavy chains and two light chains. The pan of the 
antibody which is specific for an antigen is constituted by the rearrangement of 
three gene segments V D and J for the heavy chains and V and J for the light 
chains. The variable segments VH of the heavy chains arc classified in VH 
subgroups relative to the sequences of the nucleic acids which encode them. In 
humans, the VH subgroups are at least the VIII, VII2, VIDa, VII3b ? VII4, 
VH5, VH6 and VII7 subgroups. Accordingly, the JII subgroups are at least the 
JH1, JH2, JH3 ? JH4, JH5 and JH6 subgroups. 

The quantitative and qualitative profile of the repertoire of a given type of an 
immunoglobulin heavy chain refers in the present invention to the profile 
corresponding at once to the relative use of the nucleic acids encoding the heavy 
chains of immunoglobulins of a given type, in particular the nucleic acids 
encoding the variable segments (VH) of immunoglobulin heavy chains of a 
given type, expressed by a lymphocyte B population and to the length of the 
VDJ rearrangements for each VH or JH subgroup. 

The repertoire of a given type of an immunoglobulin heavy chain refers to the 
immunoglobulin heavy chains of a given type which are expressed by a 
30 lymphocyte B population present in a tissue sample. 
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The type, or class, of an immunoglobulin heavy chain may be any type which is 
sufficiently described in the literature, such as the TgM type, the TgG type, in 
particular the IgGl type, the IgG2 type, the TgG3 type, or the IgG4 type, Ihe IgE 
type, the IgA type or the IgD type. Thus, for example, the quantitative and 
5 qualitative profile will be determined for the TgM type with the process 

according to the present invention. 

Indeed, the lymphocytes B are subjected to the isotypic commutation: for the 
same variable segment, an immunoglobulin can belong to a given type and su 
10 have different properties relative to its type. The IgM are the most present 

immunoglobulins in the scrum, the lgG correspond generally to an anamnestic 
reaction, the IgA arc mostly expressed by the gut mucosa and the IgE are 
characteristic for an allergic response. 

15 The size of a B lymphocyte clone may vary in an important manner in the 

course of time or after an immune response. The process for determining the 
quantitative and qualitative profile of the present invention allows to detect 
directly, without inevitably sequencing them, the presence of clonal expansions 
in a given rearrangement, 

20 

The tissue sample comprising the lymphocyte B population is usually the 
lymphocyte B population present in the blood of a subject, but it be may also 
tissue sample of the lymphoid system, such as For example lymph nodes or 
tonsil. The process for determining the quantitative and qualitative profile of the 

25 present invention doesn't necessarily require to previously purify the 

lymphocyte B population from the tissue sample, thus allowing to greatly 
facilitate its implemantation. Accordingly, the process may be performed from a 
heterogeneous cellular population, but also from a previously purified 
lymphocyte B population, or from a lymphocyte B subpopulation, such as for 

30 example naive B lymphocytes versus the memory B cells or according to the 

antigenic specificity antibodies they express, and therefore performing previous 
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classical purification techniques such as magnetic sorting or any cellular sorting 
technique. 

The cellular nucleic acid content of the tissue sample is then obtained from the 
5 tissue sample using conventional techniques. When this nucleic acid content is 

mRNA, the cDNA synthesis performed thanks to the reverse transcription 
reaction is well known, as well as the method allowing to only transcribe the 
mRNA present in the total RNA of the tissue sample thanks to a poly(T)primcr 
(see Example 1: U RNA and cDNA preparation'*). The nucleic acid content may 
1 0 also be Lhe DNA extract of the tissue sample (see HP 566685). 

Methods of DNA engineering are sufficiently described in the literature, and are 
well known by the man skilled in the art, who will know how to use them in the 
most convenient manner in the process of the invention (see Sambrook et aL 

15 (1989), Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

Laboratory Press, Cold Spring Harbor, N.Y.; by Ausubcl ct al. (1994), eds 
Current Protocols in Molecular Biology, Current Protocols Press; and by Berger 
and Kimmel (1987), Methods in Enzymology: Guide to Molecular Cloning 
Techniques, Vol 1 52, Academic Press, Inc., San Diego, Calif, the disclosures of 

20 which arc hereby incorporated by reference). 

Then the amplification is performed using a PCR type technique, that is to say 
the PCR technique or any other apparented technique: two primers (VH and 
CH), respectively complementary to the variable segment and to the constant 
25 segment of a given type of Ig heavy chain are then added to the nucleic acid 

content (cONA or DNA) along with a polymerase such as Taq polymerase, and 
the polymerase amplifies the cDNA region between the VH and CH primers. 

The expression "specifically hybridizing in stringent conditons" refers in the 
30 present invention to a hybridizing step in the process of the invention where the 

oligonucleotide sequences selected as probes or primers are of adequate length 
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and sufficiently unambiguous so as to minimize the amount of non-specific 
binding lhat may occur during the amplification. 

Hybridization is typically accomplished by annealing the oligonucleotide probe 
5 or primer to the cDNA (or DNA) under conditions of stringency that prevent 

non-specific binding but permit binding of this cDNA which has a significant 
level of homology with the probe or primer. 

Among the conditions of stringency is the melting temperature Tm for the 
10 amplification step using cither the set of VH forward primers associated with the 

CH primer, or the VH internal forward primer associated with the set of JH 
reverse primers, which is in the range of about 58°C to about 60°C ; preferably, 
the Tm for the amplification step is in the range of about 58°C or about 60°C. 
Most preferably, the Tm for the amplification step is about 60°C. 

15 

Typical hybridization and washing stringency conditions depend in part on the 
size (i.e., number of nucleotides in length) of ihe cDNA or oligonucleotide 
probe (Ausubel and al., 1994, eds Current Protocols in Molecular Biology). 



20 The oligonucleotide probes or primers herein described may be prepared by any 

suitable methods such as chemical synthesis methods (see references supra for 
methods of DNA engineering). 

Preferably, the process for determining the quantitative and qualitative profile 
25 according to the present invention is characterized in that separated 

amplifications are performed for each of the VH subgroups. 



30 



Indeed, the nucleic acid content (cDNA or DNA) obtained at the step (a) from 
the tissue sample is divided such as to perform so much separated amplifications 
as there are couples of VH and CH primers. 
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By the expression mixture of CH reverse primer it is intended to refer in the 
present invention to a mixture of at least two, preferably at least 2, 3, 4 or at 
least 5 CH reverse primers, said CH reverse primers being preferably present in 
an equivalent quantity between each other. For example, when there are 2 CIT 
5 reverse primers in the mixture, each CH reverse primer is present at 50 % in said 

mixture. 

In a preferred embodiment, the process for determining the quantitative and 
qualitative profile according to the present invention is characterized in that the 
separated amplifications are real-time separated amplifications, said real-time 
amplifications being performed using a CH labeled reverse probe, prfcrably a 
CH labeled reverse hydrolysis-probe, capable of specifically hybridizing in 
stringent conditions with the constant segment of the given type of 
immunoglobulin heavy chain and capable of emiting a detectable signal 
every time each amplification cycle occurs, and characterized in that the signal 
obtained for each VII subgroup is measured. 

Hie real-time amplifi cation, such as real-time PCR, is well known by the man 
skilled in the art, and the various known techniques will be employed in the best 
way for the implementation of the present process. These techniques are 
performed using various categories of probes, such as hydrolysis probes 
(TaqMan, Applied Biosystems, USA), hybridization adjacent probes, or 
molecular beacons. Hydrolysis probes are preferred. The techniques employing 
hydrolysis probes or molecular beacons are based on the use of a fluorescence 
quencher/reporter system, and the hybridization adjacent probes are based on 
the use of fluorescence acceptor/donor molecules. 

Hydrolysis pTobes with a fluorescence quencher/reporter system are available in 
the market, and are for example commercialized by the Applied Biosytcms 
30 group (USA). Many fluorescent dyes may be employed, such as FAM dyes (6- 

carboxy-fluorescein), or any other dye phosphoramidite reagents. In the present 
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invention, the inventors have employed a TaqMan Minor Groove Binder (MGB) 
FAM-iabeled probe (sec Example 1), but other probes may be used, in a 
convenient manner. 

5 Among the stringent conditions applied for anyone of the hydrolysis-probes of 

the present invention is the Tm, which is in the range of about 68°C to 70°C ; 
preferably, the Tm for anyone of the hydrolysis-probes of the present invention 
is in the range of about 69°C to about 70°C. Most preferably, the Tm applied for 
anyone of the hydrolysis-probes of the present invention is about 70°C. 

10 

Tn another preferred embodiment, the process for determining the quantitative 
and qualitative profile according to the present invention is characterized in that 
the separated amplification products obtained for each of the VH subgroups are 
further elongated using a CH labeled reverse probe capable of specifically 

1 5 hybridizing in stringent conditions with the constant segment of the given type 

of immunoglobulin heavy chain and capable of emiting a detectable signal, and 
characterized in that the elongation products are separated, for each of the VH 
subgroups, relative to their length, the signal obtained for the separated 
elongation products is measured, and the quantitative and qualitative profile of 

20 the labeling intensity relative to the elongation product length is established, for 

each of the VH subgroups individually. 

This elongation step, also named "run-off reaction", allows to determine the 
length of the VDJ reanrangemcnts fro each VII subgroup. The separated 

25 amplification products obtained for each VH subgroup are elongated using a 

DNA polymerase and a CH labeled reverse probe capable of specifically 
hybridizing in stringent conditions with the constant segment of the given type 
of immunoglobulin heavy chain and capable of emiting a detectable signal. 
Elongation products are thus obtained, which are labeled at their CH end and 

30 their length can be determined. 
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The determination of ihe length can be realized using conventional techniques 
for the determination of DNA sequences, with gels such as polyacryjamide gels 
for the separation, DNA sequencers and adapted softwares. The labeling of the 
Cll labeled reverse probe may be any appropriate labeling, such as for example 
5 radio-labeling or preferably lluoresccnt-labeling. The "run-off reaction" is well 

described in EP 566 685. 

Among the stringent conditions applied for anyone of the labeled probes used 
for the elongation ("Tun-ofiT reaction) is the Tm, which is in the range of about 
10 58°C to about 60°C ; preferably, the Tm for (he amplification step is in the 

range of about 58°C or about 60°C Most preferably, the Tm for the 
amplification step is about 60°C. 

Preferably, the process for determining the quantitative and qualitative profile 
according to the present invention is characterized in that the set of VH forward 
primers comprises at least the 8 following subgroups of VH primers 
corresponding to the VH subgroups : 

- the VH1 primers having the sequences SEQ ID N° 1 to SEQ ID N c 3, and 

- the VH2 primer having the sequence SEQ ID N° 4, and 

- the VH3a primers having the sequences SEQ ID N° 5 and SEQ ID N° 6, and 

- the VH3b primers having the sequences SEQ ID N° 7 to SEQ ID N v 10, and 

- the VH4 primers having the sequences SEQ ID N° 11 and SEQ ID 1ST 12, 
and 

- the VH5 primer having the sequence SEQ TD N° 1 3, and 

- the VH6 primer having the sequence SEQ ID N° 14, and 

- the VH7 primer having the sequence SEQ ID N° 15. 

More preferably, the process for determining the quantitative and qualitative 
profile according to the present invention is characterized in that the sequences 
30 SEQ ID N° 1 to SEQ ID N° 15 may contain at least one to three point 

mutations, except for the nucleotides 1 to 6 of their 3* part. 
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The point mutation refers in the present invention to a mutation which occurs 
for only one nucleotide, on the contrary of a mutations which occurs for an 
oligonucleotide sequence. The point mutation may be a substitution, a deletion 
5 or an addition. 

The process for determining the quantitative and qualitative profile according to 
the present invention may further be characterized in that the CH reverse primer 
is selected from the CI I reverse primers capable of specifically hybridizing in 
10 stringent conditions with the nucleic acids encoding the constant segments (CH) 

of the TgM heavy chain, the lgE heavy chain and the IgA heavy chain. 

Preferably, the process for determining the quantitative and qualitative profile 
according to the present invention is characterized in that, when the CH reverse 
15 primer is capable of specifically hybridizing in stringent conditions with the 

nucleic acid encoding the constant segment (CH) of the IgM heavy chain, the 
CH reverse primer has the sequence SEQ ID NT 26, or the sequence SKQ ID N D 
26 wherein one to three point mutations may occur, except for the nucleotides 1 
to 6 of its 3^ part. 

20 

In another preferred embodiment, the process for determining the quantitative 
and qualitative profile according to the present invention is characterized in that, 
when the CH reverse primer is capable of specifically hybridizing in stringent 
conditions with the nucleic acid encoding the constant segment (CH) of the igfc 
25 heavy chain, the CH reverse primer has the sequence SEQ ID N° 33, or the 

sequence SEQ ID N° 33 wherein one to three point mutations may occur, except 
for the nucleotides 1 to 6 of its 3' part. 



30 



In another prefen-ed embodiment, the process for determining the quantitative 
and qualitative profile according to the present invention is characterized in that, 
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when the given type of immunoglobulin heavy chain is the IgG type, a mixture 
of two CH reverse primers is associated with the set of VH forward primers, 
said two CH reverse primers having the sequences SEQ ID N° 27 and SEQ ID 
N° 28, or the sequences SEQ ID N° 27 and SEQ ID N° 28 wherein one to three 
5 point mutations may occur, except for the nucleotides 1 to 6 of its 3' part. 

Preferably, the process for determining the quantitative and qualitative profile 
according to the present invention may further be characterized in that the CH 
reverse primer is selected from the CH reverse primers capable of specifically 
hybridizing in stringent conditions with the nucleic acids encoding the constant 
segments (CH) of the IgGl heavy chain, the lgG2 heavy chain, the TgG3 heavy 
chain or the TgG4 heavy chain. 

furthermore, the process for determining the quantitative and qualitative profile 
1 5 according to the present invention is characterized in that, when the given type 

of immunoglobulin heavy chain is an TgM heavy chain and when the separated 
amplifications arc real-time separated amplifications, the CH labeled hydrolysis- 
probe has the sequence SEQ TD N° 29, or the sequence SEQ ID N° 29 wherein 
at least one point mutation may occur. 

20 

Furthermore, the process for determining the quantitative and qualitative profile 
according to the present invention is characterized in that, when the given type 
of immunoglobulin heavy chain is an lgM heavy chain and when the separated 
amplification products obtained for each of the VH subgroups are further 
25 elongated, the CH labeled reverse probe has the sequence SEQ TD N° 30, or the 

sequence SEQ IDN° 30 wherein one to three point mutations may occur, except 
foT the nucleotides 1 to 6 of its 3' part. 

Preferably, the process for determining the quantitative and qualitative profile 
30 according to the present invention is characterized in that, when the given type 

of immunoglobulin heavy chain is an TgE heavy chain and when the separated 
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amplifications are real-time separated amplifications, die CH labeled hydrolysis- 
probe has the sequence SEQ ID N° 36, or the sequence SEQ ID N° 36 wherein 
at least one point mutation may occur. 



5 Furthermore, the process for determining the quantitative and qualitative profile 

according to the present invention is characterized in that, when the given type 
of immunoglobulin heavy chain is an IgE heavy chain and when the separated 
amplification products obtained for each of the VII subgroups are further 
elongated, the CH labeled reverse probe has the sequence SEQ ID N° 37 ? or the 
10 sequence SEQ ID N° 37 wherein one to tlirce point mutations may occur, except 

for the nucleotides 1 to 6 of its 3' part. 



Preferably, the process for determining the quantitative and qualitative profile 
according to the present invention is characterized in that, when the given type 
1 5 of immunoglobulin heavy chain is an JgG heavy chain and when the separated 

amplifications are real-time separated amplifications, the CH labeled hydrolysis- 
probe has the sequence SEQ ID N* 34, or the sequence SEQ ID N° 34 wherein 
at least one point mutation may occur. 

20 Furthermore, the process for determining the quantitative and qualitative profile 

according to the present invention is characterized in that, when the given type 
of immunoglobulin heavy chain is an IgG heavy chain and when the separated 
amplification products obtained for each of the VH subgroups are further 
elongated, the CH labeled reverse probe has the sequence SEQ ID N° 35, or the 

25 sequence SEQ ID N° 35 wherein one to three point mutations may occur, except 

for the nucleotides 1 to 6 of its 3' part. 



30 



Preferably, the CH labeled hydrolysis-probes having the sequences SEQ ID N° 
29, SEQ ID N° 34 and SEQ TD N° 36, may have 2 point mutations in their 
sequences. 
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The point mutations which may occur in the sequences of these CH labeled 
hydrolysis-probes may be any point mutations provided that this sequence is of 
adequate length and sufficiently unambiguous so as to minimize the amount of 
non-specific binding that may occur. 

5 

In one further embodiment, the process for determining the quantitative and 
qualitative profile according to the present invention is characterized in that 
further separated amplifications for each of JH subgroups are performed from 
the separated amplification products obtained for at least one given VH 

10 subgroup of the VII subgroups with the CH reverse primer, 

said further separated amplifications being performed using a VH internal 
forward primer corresponding to the given VH subgroup, and associated with a 
set of JH reverse primers corresponding to the JH subgroups and capable of 
specifically hybridizing in stringent conditions with the nucleic acids encoding 

15 the j unction segments of the given type of immunoglo bulin heavy chain. 

The stringent conditions applied for the further separated amplifications using 
the VII internal forward primer associated with the set of JH reverse primers are 
the same as that applied for the amplification step using the set of VH forward 
20 primers associated with the CH primer (see supra). Most preferably, the Tm for 

the amplification step using the VH internal forward primer associated with the 
set of JH reverse primers is about 60 Q C. 

Preferably, the process for determining the quantitative and qualitative profile 
25 according to the present invention is characterized in that the further separated 

amplifications arc real- time amplifications performed using a VH labeled 
forward probe, preferably a VH labeled forward hydrolysis-probe, capable of 
specifically hybridizing in stringent conditions with the variable segment of the 
given type of immunoglobulin heavy chain and capable of emiting a detectable 
30 signal everylime each amplification cycle occurs, and characterized in that the 

signal obtained for each JH subgroup is measured. 
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Among the stringent conditions applied for the VH labeled forward hydrolysis- 
probe capable of specifically hybridizing with the variable segment of the given 
type of immunoglobulin heavy chain is the Tm ? which is in the range of about 
5 68°C to about 70°C ; preferably, the Tm for the VH labeled forward hydrolysis- 

probe is in the range of about 69°C to alxmt 70°C ; Most preferably, the Tin 
applied for the VTT labeled forward hydrolysis-probe is about 70 V C. 

More preferably, the process for determining the quantitative and qualitative 
10 profile according to the present invention is characterized in that, when the 

given VH subgroup is the VII5 subgroup., the VH5 internal forward primer has 
the sequence SEQ ID N° 31, or the sequence SEQ ID N° 31 wherein one to 
three point mutations may occur, except for the nucleotides 1 to 6 of its 3' part. 

15 More preferably, the process for determining the quantitative and qualitative 

profile according to the present invention is characterized in that the VTT labeled 
forward hydrolysis-probe has the sequence SEQ ID N° 32 ? or the sequence SEQ 
ID N° 32 wherein at least one point mutation may occur. 

20 Preferably, the VH labeled forward hydrolysis-probe having the sequence SEQ 

ID N° 32 may have 2 point mutations in its sequence. 

The point mutations which may occur in the sequences of this VII labeled 
forward hydrolysis-probe may be any point mutations provided that this 
25 sequence is of adequate length and sufficiently unambiguous so as to minimize 

the amount of non-specific binding that may occur. 

In one further embodiment, the process for determining the quantitative and 
qualitative profile according to the present invention is characterized in that 
30 separated elongations are performed for each of the JII subgroups from the 
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separated amplification products obtained for at least one given VH subgroup of 
the VH subgroups with the CII reverse primer. 

said further separated elongations being performed using a set of JH labeled 
reverse primers corresponding to JH subgroups and capable of specifically 
5 hybridizing in stringent conditions with the nucleic acids encoding the junction 

segments oT the given type of immunoglobulin heavy chain, said JH labeled 
reverse primers being capable ofemiting a detectable signal, 
and characterized in that the elongation products are separated, Tor each of the 
Jll subgroups, relative to their length, the signal obtained for the separated 
10 elongation products is measured, and the quantitative and qualitative profile of 

the labeling intensity relative to the elongation product length is established, for 
each of the JTT subgroups for the given VTI subgroup. 



The stringent conditions applied for the set of JH labeled reverse primers used 
15 for the elongation and capable of specifically hybridizing with the junction 

segment of the given type of immunoglobulin heavy chain is the Tm, which is in 
the range of about 58 and about 60 °C, preferably, about 59 °C, most preferably, 
about 60 °C 



20 Preferably, the process for determining the quantitative and qualitative profile 

according to the present invention is characterized in that the set of JH forward 
primers, optionally labeled, comprises at least the 6 following subgroups of JH 
primers corresponding to the JH subgroups : 

the Jill primer having the sequence SEQ ID N° 1 6, and 

25 - the JH2 primer having the sequence SEQ ID N° 1 7, and 

- the JH3 primer having the sequence SEQ ID N° 1 8, and 

- the JH4 primers having the sequences SEQ ID N° 1 9 to SEQ TD N° 21, and 

- the JH5 primer having the sequence SEQ ID N° 22, and 

- the JH6 primers having the sequences SEQ ID N° 23 to SEQ ID N° 25. 



30 
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More preferably, the process for determining the quantitative and qualitative 
profile according to the present invention is characterized in thai the sequences 
SEQ ID N° 16 to SEQ ID N° 25 may contain at least one to three point 
mutations, except for the nucleotides 1 to 6 of their 3' part. 

5 

One another subject of the present invention is a set of VH forward primers 
capable of specifically hybridizing in stringent conditions with the nucleic acids 
encoding the variable segments (VH) of immunoglobulin heavy chains, said 
variable segments being distributed among at least 8 VH subgroups, associated 

10 with a CII reverse primer, or a mixture thereof, capable of specifically 

hybridizing in stringent conditions with the nucleic acid encoding the constant 
segment (CH) of a given type of an immunoglobulin heavy chain, characterized 
in that the set of VH forward primers comprises at least the 8 following 
subgroups of VH primers corresponding to the VH subgroups : 

15 - the VH1 primers having the sequences SEQ ID N° 1 to SEQ ID N° 3> and 

- the VH2 primer having the sequence SEQ ID N° 4, and 

- the VH3a primers having the sequences SEQ ID N° 5 and SEQ ID N° 6, and 

- the VH3b primers having the sequences SEQ ID N° 7 to SEQ ID N° 10, and 

- the VH4 primers having the sequences SEQ ID N° 11 and SEQ ID 12 ? 

20 and 

- the VH5 primer having the sequence SEQ ID N° 13, and 

- the VH6 primer having the sequence SEQ ID N Q 14, and 

- the VI17 primer having the sequence SEQ ID N° 1 5. 

25 Preferably, the set of VH forward primers according to the present invention is 

characterized in that the sequences SEQ ID N° 1 to SEQ ID N° 15 may contain 
at least one to three point mutations, except for the nucleotides 1 to 6 of their 3' 
part. 



30 



More preferably, the set of VH forward primers according to the present 
invention is characterized in that the CH reverse primer is selected from the CH 
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reverse primers capable of specifically hybridizing in stringent conditions with 
the nucleic acids encoding the constant segments (CH) oTthc IgM heavy chain, 
the IgC heavy chain and the IgA heavy chain. 

Preferably, the set of VH forward primers according to the present invention is 
characterized in that, when the CH reverse primer is capable of specifically 
hybridizing in stringent conditions with the nucleic acid encoding the constant 
segment (CH) of the IgM heavy chain, the CH reverse primer has the sequence 
SEQ ID N° 26, or the sequence SEQ ID N° 26 wherein one to three point 
mutations may occur, except for the nucleotides 1 to 6 of its 3* part. 

In another preferred embodiment, the set of VIT forward primers according to 
the present invention is characterized in that, when the CH reverse primer is 
capable of specifically hybridizing in stringent conditions with the nucleic acid 
15 encoding the constant segment (CH) of the IgE heavy chain, the CH reverse 

primer has the sequence SEQ ID N° 33, or the sequence SEQ ID N° 33 wherein 
one to three point mutations may occur, except for the nucleotides 1 to 6 of its 
3' part. 

In another preferred embodiment, the set of VH forward primers according to 
the present invention is characterized in that, when the given type of 
immunoglobulin heavy chain is the lgG type, a mixture of two Cll reverse 
primers is associated with the set of VH forward primers, 
said two CH reverse primers having the sequences SEQ ID N° 27 and SEQ ID 
N° 28, or the sequences SEQ ID N° 27 and SEQ ID N° 28 wherein one to three 
point mutations may occur, except for the nucleotides 1 to 6 of its 3' part. 

Preferably, the set of VH forward primers according to the present invention 
according to the present invention may further be characterized in that the CH 
30 reverse primer is selected from the CH reverse primers capable of specifically 

hybridizing in stringent conditions with the nucleic acids encoding the constant 



5 



10 
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segments (CH) of the TgGl heavy chain, the IgG2 heavy chain, the IgG3 heavy 
chain or the IgG4 heavy chain. 

Another subject of the present invention is a method for the in vitro diagnosis of 
5 a condition associated with an abnormal expression of the repertoire of a given 

type of an immunoglobulin heavy chain by a lymphocyte B population in a 
subject, characterized in that it comprises the following steps: 

(1) determining the quantitative and qualitative profile of the given type of 
immunoglobulin heavy chain from a tissue sample of said siibject according to 

] 0 the present invention, 

(2) comparing the quantitative and qualitative profile obtained at the step (1) 
with a conirol quantitative and qualitative profile of said given type of 
immunoglobulin heavy chain, the demonstration of a significant modification of 
the profile obtained at the step (1) being significant of such a condition in the 

15 subject. 

Preferably, the method for the in vitro diagnosis according to the present 
invention is characterized in that the condition is an auto-immunc disease, a B 
cell lymphoma or an immimodepressive disease. 

20 

In another preferred embodiment, the method for the in vitro diagnosis 
according to the present invention is characterized in that the condition results 
from a bone marrow transplantation, from a vaccinal test or from an allergic 
reaction. 

25 

Another subject of the present invention is a method for the in vitro follow-up of 
a treatment of a condition associated with an abnormal expression of the 
repertoire of a given type of an immunoglobulin heavy chain by a lymphocyte B 
population in a subject, characterized in that it comprises the following steps: 
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(1) optionally, determining before the treatment the quantitative and 
qualitative profile of the given type of immunoglobulin heavy chain from a 
tissue sample of said subject according to the present invention, 

(2) determining, during the treatment, the quantitative and qualitative profile 
5 of the given type of immunoglobulin heavy chain at given times from tissue 

samples of said subject according to the present invention, 

(3) comparing the quantitative and qualitative profiles obtained at the step 
(2) and optionally at the step (1) with each others and optionally with a control 
quantitative and qualitative profile of the given type of immunoglobulin heavy 

10 chain, the demonstration of a significant modification of the profile obtained at 

the step ( 1 ) being significant of such a condition in the subject. 

Preferably, the method for the in vitro follow-up according to the present 
invention is characterized in that the condition is an auto-immune disease, a B 
15 cell lymphoma or an immunodepressive disease. 

In another preferred embodiment, the method tor the in vitro follow-up 
according to the present invention is characterized in that the condition results 
from a bone marrow transplantation, from a vaccinal test or from an allergic 
20 reaction. 

Another subject of the present invention is a kit for determining the quantitative 
and qualitative profile of the repertoire a given type of an immunoglobulin 
heavy chain expressed by a lymphocyte B population present in a tissue sample, 
25 characterized in that it comprises the set of VH forward primers according to the 

present invention associated with the CH reverse primer. 

Preferably, the kit according to the present invention is characterized in that it 
further comprises a set of JH reverse primers, optionally labeled, corresponding 
30 to the JII subgroups and capable of specifically hybridizing in stringent 
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conditions with the nucleic acids encoding the junction segments of the given 
type of immunoglobulin heavy chain. 

More preferably, the kit according to the present invention is characterized in 
5 that the set of JH reverse primers comprises the 6 following subgroups of JH 

primers corresponding to the JH subgroups : 

- the JHi primer having the sequence SEQ ID N° 16, and 
the JH2 primer having the sequence SEQ ID N° 1 l s and 

- the JH3 primer having the sequence SEQ ID N° 1 8, and 

1 0 - the JH4 primers having the sequences SEQ ID N° 1 9 to SEQ ID N° 21, and 

- the JH5 primer having the sequence SEQ ID N° 22, and 

- the JH6 primers having the sequences SEQ ID N° 23 to SEQ ID N° 25. 



More preferably, the kit according to the present invention is characterized in 
15 that the sequences SEQ ID N° 16 to SEQ ID N* 25 may contain at least one to 

three point mutations, except for the nucleotides 1 to 6 of their 3' part. 



Kits comprising primers according to the present invention are well described in 
the literature and may further comprise suitable reagents. 

20 

Another subject of the present invention is the use of the kit according to the 
present invention, for the in vitro diagnosis of a condition associated with an 
abnonmal expression of the repertoire of a given type of an immunoglobulin 
heavy chain by a lymphocyte B population in a subject 

25 

Preferably, the use of the kit according to the present invention is characterized 
in that the condition is an auto-immunc disease, a B cell lymphoma or an 
immunodepressive disease. 
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In another preferred embodiment, the use of Ihe kit according to the present 
invention is characterized in that the condition results from a bone marrow 
transplantation, from a vaccinal test or from an allergic reaction. 

5 

Ilie purpose of the legends of the figures and of the examples below is lo 
illustrate the invention. It doesn't limit the scope of the claimed invention. 



1 0 LEGENDS OF THE FIGURES 

Figure 1: Immunoscopcs profiles from two healthy donors. 

In Figure 1A, purified B cells from two healthy donors, donors 789 and 743 
15 were subjected to VH families specific PGR amplification as detailed in 

Materiel and Methods using VII specific primers (see Table 2) and C\i primer, 
followed by a 4< run-oflT reaction with a C^-Fam probe. For donor 743, 
Immunoscope profiles were obtained from two separate samples, sample W and 
Z, harboring ihe same initial number of B cells. In Figure IB, Immunoscope 
20 profiles were realized for two given rearrangements, VH5-JI11 and VH5-JH2. 

For both 789 and 743 donors, VH5-JH1 rearrangement have been chosen as 
prototype rearrangement for diversity determination, and subjected to 
exhaustive sequencing. In addition, purified CDR3 band from VH5-JH2 
rearrangement of donor 789 (shown by arrow) was also used for the same 
25 purpose. This purified band used have a CDR3 length of 8 aminoacids. 

Figure 2: Two different examples of IgM clonal expansions 

Clonal expansion A has been found in the course of VH5-JH2 exhaustive 
sequencing from donor 749. All the sequences from clonal expansion A have 
30 the same CDR3 of 14 aminoacids with the sequence S£Q ID N°38. Clonal 

expansion B have been found in the course of VH5-JH1 exhaustive sequencing 
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10 



from donor 749. All the sequences from clonal expansion B have Ihc same 
CDR3 of 25 aminoacids with the sequence SEQ ID N°39. The sequences differs 
in the mutations occuring in the VH5 region. Codons where mutations occur are 
initiated in red according to standart nomenclature.* or ** means differents 
mutations occuring on the same codon. Numbers of each clone for a given 
pattern of mutations are indicated in black. All mutations occurs from the 
germline clones (GL). Dashed clones are not found virtual intermediate clones. 
All existing clones are denominated by a letter. 



Figure 3: Properties and distribution of mutated versus germlime 
sequences 

V115-JH1 exhaustive sequencing from donor 743 was realized separately in two 
samples, W and Z containing the same number of B cells. (See Table 4). Figure 

15 3 A represent the distribution of the sequences between the two samples. tW, tZ 

and tOv stand for total sequences of W 7 Z and W-Z Overlap respectively, d W, 
dZ and dOv means the different sequences in W, Z and W-Z Overlap 
respectively. Figure 3B shows the distribution of germline versus mutated 
sequences in relation with the number of sequences found for each individual 

20 clone. 

EXAMPLES 



Example 1: Experimental procedures 

25 

1 . Cell preparations, antibodies and flow cytometry. 

Blood (approximatively 500ml) from three healthy donors was obtained from 
the Etablissement Fran9ais du Sang, Necker Enfants Malades Lecourbe, Paris, 
France. Donor 789 is a 56 year old woman, donor 743 is a 55 year old woman, 
30 donor 522 is a 30 year old man. PBMCs were isolated by centiifugation over 
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FicoJl-Paquc (Amcrsham Biosciences AB, Uppsalla, Sweden) in UNI-SEP ma xi+ 
tubes (Novamed, Jerusalem, Israel). 

B cells from donor 789 were then obtained from the PBMCs by depletion using 
the B cell negative isolation kit from Dynal (Dynal, Oslo , Norway). For donors 
5 743 and 522, PBMCs were co-stained with CD19 beads and CD19-PG 

(Pharmingen, San Jose, CA) and purified by positive selection on AutoMACS 
using respectively Possel or Possel-S programs (Mikenyi Biotcc, Bcrgisch- 
Gladbach, Germany). The eluted cells from the positive and negative fractions, 
as well as the total PBMCs were then labeled with either anti-IgM, anti-IgG, 

10 anlt-IgD, Ig-E, anti-lgAl/IgA2, anti-kappa and anti-lambda antibodies 

(Pharmingen, San Jose, CA) and rabbit polyclonal anti-Human IgM (Jackson 
Immunorcsearch laboratories, West Grove, Pennsylvania). 
Using these different antibodies, the percentage and the absolute number of cells 
bearing the different antibody iso types were determined among the PBMC and 

15 B cells positive fraction of donor 743 and 522 (Table 1). The purity of the 

positive fraction was checked either by the CD 19 staining, cither by the sum of 
anti-kappa and anti-lambda staining. 

2. RNA and cDNA preparation 

20 Cells from both positive and negative fractions were aliquoted and kept frozen at 

-20° in the lysis buffer of the RNeasy mini-kit (Qiagcn, Conrtaboeuf, France). 
Total RNA was extracted using the RNeasy mini-kit (Qiagen, Courtaboeuf, 
France) according to the manufacturers specifications, as previously described 
(Lim and al. (2002). J Immunol Methods 261, 177-94). cDNA was then 

25 prepared by RNA reverse-transcription with 0,5[ig/|i] oligo (dT)17 and 200 U of 

Superscript IT reverse transcriptase (Invitrogen, Cergy Pontoise, France). 

3. Quantitative Immunosc pe 

Quantitative Tmmunoscope analysis was performed as described (Lim and al. 
30 (2002). The different VH germline genes can be clustered in seven families 

according to their level of homology. Both 1MGT (http://imgt.cines.fr) (Lefranc 
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and al. (1999). Nucleic Acids Res 27, 209-12) and Vbase databases 
(hUp://www.mrc-cpe.cam_ac.uk) (Tomlinson and aL (1998). V Base Sequence 
Directory. In, M. C. f. P. Engineering, ed. (Cambridge, UK.)) were used to have 
access to their sequences and the germinal gene were aligned using GCG 
5 Wisconsin package program 

(http://www.accelrys.com/producis/gcg wisconsin_packagc). 
All the primers used are shown in Table 2. In order to have the opportunity, to 
study somatic mutations in the VH regions, VH family specific primers were 
chosen in the FR1 region for all the VH families except VH3 and VIM families 

10 where the specific primers were designed in Ihe FR3 region. VH3 gene family, 

the largest one, was divided in two sub-families VH3a and VH3b in order to 
achieve a better specifity. This specifity was tested by specificaly amplify each 
VH family from a PBMC mix of three hcathly donors using VH family specific 
primers and HIGCM primer, PGR products cloning and sequencing. For only 

15 one VH family, the VH7 family, full specifity of the amplification was not 

achieved as few of the amplified VH were belonging to the VIII family. 

For the quantification of the VH families usage for IgM expressing cells, an 
aliquot of the cDNA was amplified using pfu high fidelity Taq polymerase 
20 (Stratagene, Amsterdam, The Netherlands) with each of ihe 8 families VH 

specific primers on one side, the HIGCM primer on the other side and the TM- 
MGB-HCM probe, a Taqman Minor Groove Binder (MGB) FAM-labeled 
nested probe specific for the C\x region. 

Fot the quantification of the JH usage, an aliquot of the cDNA was amplified on 
25 the 5' side with the VTISint specific primer (SEQ ID N° 31) and each of the 6 

JIT specific primers on the 3* side together with the TM-MGB-VH5int probe 
(SEQ ID N° 32), a Taqman MGB FAM-labeled nested probe specific for the 
VI 15 family. Ail Taqman MGB probes were designed using the Primer express 
software program (Applied Biosystems. Comtaboeuf, France). PCR reactions 
30 were carried out in 25^1 total volume. For all these dilTerents reactions, real time 

quantitative PCR was then performed on an ABI 5700 system (Applied 
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DiosysLems, Courtahoeuf; France). The relative usage of each VII family or 
each JH segment was calculated according to the above formula: 

x=8 x=6 
U(lgVHy) = 2 2<c,oo-C<y)> U(lgJH y) = JL 2<o»*iW> 

In which C t (x) is the fluorescent threshold cycle number mesured for Vll(y) 
5 family or JH(y)> In this case, the VH family primers pair display a mean 

efficiency of 0.95 ± 0.08 and the JH segment primers display a mean efficiency 
of 0.91 -hO-08. 

For the VH family amplifications, 2\il of each amplification reactions were used 
as template in a run-off reaction initiated by either the HCM-FAM primer, a 

10 fluorescent nested C|i primer, the JH1-FAM: 5'-Fam- 

CCCTGGCCCCAGTGCTG-3 , (SEQ ID NT 16) or JH2-1-AM 5Tam- 
CCACGGCCCCAGAOATCGO' (SEQ ID N° 17) primers or the VH5-FAM 
primer in a total volume of 10|.il as previously described (Pannetier and al. 
(1997)- Tn The Antigen T Cell receptor: Selected Protocols and Applications., J. 

15 R. E- Oksenbcrg, ed. (Landes Bioscience, Chapman & Hall), pp. 287). All 

fluorescent fragments were then separated on a denaturing 6% acrylamide gel, 
run on an automated 373 DNA sequencer (Applied Biosystems, Couitaboeuf, 
France) and analyzed with immunoscope software (Pannetier and al. (1993). 
Proc Natl Acad Sci U S A 90, 43 19-23). 

20 

4. CDR3 band purification 

In the case of VII5-JII2 rearrangement amplification, bands corresponding to a 
CDR3 length of 8 amino-acids bad to be purified from the acrylamide gel. For 
this purpose, the PCR products were made visible on the acrylamide gel by a 
25 DNA silver staining system as previously described (Promega, Madison, Wi) 

(Casrouge and al. (2000). J Immunol 164, 5782-7: Raaphorst and al. (1996). 
Biolechniques 20, 78-82, 84, 86-7). The hands of interest were cutted from the 
gel and disrupted in 50jjI of TE. A second 33 cycles PCR using the VH5inl and 
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the JH2 primers was then conducted with l|xl of the band purification recovered 
PCR product and the amplification was then cloned as follow. 

5. VH transcript sequencing and analysis 

5 Either the total product of the VH5-JH1 rearrangement from different donors or 

the two PCR bands purified products amplified from the VTT5-JTT2 (8aa) CDR3 
bands) were cloned using the TOPO Blunt PCR cloning kit (Invitrogen, Cergy 
Pontoise, France). Sequencing reactions were performed as previously described 
(Arstila and a). (1999). Science 286, 958-61) directly on these products using 

10 VHSint primer and the ABI PRISM Big Dye Terminator Cycle Sequencing 

Ready Reaction Kit (Applied Biosystems, Courtaboeuf, France). These 
sequencing reactions were then run on an ABI PRISM 3700 DNA analyser 
(Applied Biosystems, Courtaboeuf, France). CDR3 regions and VH mutations 
of the corresponding sequences were extracted and analysed using Taps 1.1 

15 software written by Emmanuel lieaudoing (Casrouge and al. (2000). J Immunol 

164, 5782-7). 

6. Calculation of the VH/Bcells repertoire size 

The size of the VH repertoire can be estimated to equal the number of distinct 
20 sequences found for a given rearrangement (when sequencing is undergone to 

saturation) divided by the product of the considered VH frequency with the 
considered JH frequency. If the size of the repertoire was deduced from a 
purified CDR3 band, the percentage of this band among the all rearangement 
was taken in account in the above calculations. Diversity due to CDR3 
25 variability can be distinguished from the diversity due to somatic mutations 

according to the way the sequences are analysed on the Taps software. The 
calculations can be done either with the number of distincts sequences given by 
limits flanking the CDR3 regions, or including most of the VH region. 
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Example 2: Quantification of the VH and JH gene segment usage in human 
PBMCs 

In order to estimate the size of the peripheral human B cell repertoire, we have 
5 adapted the Immmunoscope method developped in our laboratory and 

previously applied for T cell repertoire studies (Pannetier and al. (1993). Proc 
Natl Acad Sci U S A 90, 4319-23), In addition, a precise quantification of VH 
and JH gene usage was performed by coupling real time PGR with 
Immunoscope analyses. B cells from several donors were subjected to those 

10 studies. Table 1 shows representative Facs analysis of two donors, before and 

after B cells enrishment, cDNA prepared from those cells of two donors were 
then submited to a step of PGR amplification using VH family specific primers 
on one side and one C\l primer, specific for the constant region of IgM on the 
other side, together with the TM-MGB-HCM fluorescent probe, nested in the 

15 constant region close to the Cj.i primer (see Material & Methods and Table 2). 

Real time determination of the level of fluorescence at each PCR step makes 
possible the quantification of VH family usage as shown in Table 3a. For a 
given donor, very large difference of VH family usage could be evidenced, VJ 13 
and VH4 gene families representing the majority of all rearrangements (60% 

20 and 20% respectively). This is compatible with a utilisation of VH gene 

proportional to the complexity of each family, since 27 different genes belong to 
the VH3 gene family and 10 to the VH4 family. Those results are aJso in 
accordance with what have been already determined by single cell PCR (54% of 
VH3 rearrangement and 23% of VH4 rearrangement out of 491 analysed cells) 

25 (Odendahl and al, (2000). J Immunol 165, 5970-9) ullhought they arc derived 

from a much larger cell number (about 70 000 cells). In order to evaluate the 
reproducibility of the quantification, purified IgM positive B cells from donor 
743 were divided into 6 samples (2.7 10 6 IgM positive cells/sample, sec fable 
3A). Two of these samples, sample W and sample Z were then tested for VII 

30 family usage quantification. As can be seen from Table 3a, VH usage of 

samples W and 7 are almost identical, as expected. In addition, the values 
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obtained for both samples are very similar lo those obtained for donor 789, 
although individual variations could be detected, for example in the VH2 family 
gene usage. 

5 The inventors then quantified of the JH usage for one given VH family. VH5 

family was chosen for two main reasons. 1°) according to 1MGT , the number of 
individual genes is limited to only two gcrmlinc encoded genes, VII5 and 
VH5.51. 2°) data on VII gene family usage (Tab1e3A) have shown that VH5 is 
nut overused (1% of all rearrangement in sample W and Z from donor 743 and 

10 3.2% for donor 789). For the quantification of JH gene utilisation, VHS-Cji 

amplification PGR products from the two dilTerem donors were subjected to a 
second PGR amplification using VH5int primer specific for the VH5 genes on 
one side and each of the 6 JH specific primers on the other side, together with 
the TM-MGB-VH5int fluorescent probe, nested in the vicinity of the VII5int 

15 primer and specific for the VH5 genes (see Material & Methods and Table 2). 

As shown in Table 3B, overutilisation of JII4 was observed for both donors. 
Again, separate quantification of samples W and Z from donor 743 gave almost 
identical results. Nevertheless, overall JII usage was less similar when donors 
789 and 743 were compared ('fable 3B). For the rest of thes study, the inventors 

20 have focused two gene segments, JHl and JH2, because they are the less used in 

all rearrangements of both donors 789 and 743. 

Example 3: Immunoscope profiles of human peripheral blood lgM positive 
B lymphocytes 

25 

Following quantification of VH and JH usage, PGR amplification was submitted 
to a <( run-off » reaction, as previously described (Panne tier and al. (1993). Proc 
Natl Acad Sci USA .90, 4319-23]. FigurelA displays for each VH family the 
different immunoscope profils obtained from both 789 and 743 donors. 
30 Compared to T cells [Arstila and al. (1999). Science 286, 958-61J, 13 cell 

profiles have two mams characteristics : 1°) the number of peaks representing a 
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given size of CDR3 is larger, 2°) the mean size of these peaks is close to 15 
amino-acids CDR3 while, for T cells, it is close to 10 amino-acids CDR3. Most 
patterns display a Gaussian repartition of the CDR3 length, in agrement with 
previously reported profiles of mouse splenic B cells [Delassus and al. (1995). J 
5 Immunol Methods 184, 219-29J although some peitubations can he observed for 

some of the smallest VH families (VH2 and VH6 families for donor 789). 
Figure IB shows Immunoscope profiles specific for the VH5-JH2 
rearrangement in donor 789 and VH5-JH1 rearrangement in the two donors 789 
and 743. Trie Irruiuuioscope software allow to calculate the aera beneath each 

10 peak which is directly proportional to the number of sequences included in the 

peak [Panncticr and al. (1993). Proc Natl Acad Sci USA 90, 4319-23], Thus, 
for the VH5-JH2 rearrangement in donor 789, the peak corresponding to a 
CDR3 length of 8 aminoacids represents 1.84% of all the VH5-JH2 
rearrangement. Similarcly the peaks corresponding to different CDR3 length of 

15 the VH5-JH2 rearrangement for the donor 789 (data not shown) were quantified 

(data not shown). Those calculations will be performed to estimate the size of B 
cell repertoire (see below). 



20 Example 4: Estimation of the size of peripheral blood lgM+ and total 

E3 lymphocytes repertoires 

To estimate the global size of the B cell repertoire, the inventors first focused on 
the VH5-JH2 rearrangement in donor 789 for the following reasons. First, VI 15 

25 family only includes two germline genes and its overall usage in the constitution 

of the antibody repertoire is not more then 3.2% (see Table 3). Second, JH2 
segment is the second less utilized gene segment among all JH fragments (2,1% 
of all VH5 rearrangements) ( Table 3). Therefore, it can be hypothesised that, 
after Immunoscope separation and quantification of all CDR3 peaks, the 

30 exhaustive sequencing of those rearrangements will be feasible. Third, die 

Immunoscope profile of VH5-JTI2 rearrangement in donor 789 is representative 
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of most blood D ceil rearrangements., characterised by a quasi-Gaussian 
repartition only slightly perturbed by the presence of few clonal expansions 
(Figure IB). Finally, the utilisation of VH5-JIT2 rearrangement to base B cells 
repertoire size calculations is possible due to the random utilisation of the 
5 variable genes in the adull. 

The first objective was to study the general size of the B cell repertoire 
irrespectively of yhe isoiyp expressed by the antibody produced. To achieve this 
goal, cDNA from donor 789 was PCR amplified using VH5 gene family- 
specific primer together with IGJH2 specific primer (see Table 2). « Run oIT» 

10 reaction was then performed with a JH2-Fam fluorescent probe and the specific 

Immunoscope profile obtained (Figure IB). Since this profile includes VTI5-JH2 
rearrangements for all antibody isolypes, it can be considered to be 
representative of the total size of the repertoire. Indeed, the total number of 
individual sequences included in this profile is beyond attempt of entire VH5- 

15 JH2 rearrangement exhaustive sequencing. Consequently, as previously 

described (Casnouge and al. (2000). J Immunol 164, 5782-7), the band 
corresponding to a CDR3 length of 8 amino acids was cut from the 
polyacrylamidc gel and subjected to a second PCJR using the VH5tnt and TGJH2 
primers (Tabic 2). Because the diversity found in the CDR3 of this band is 

20 proportional to the peak aera deduced from the Immunoscope profile, the size of 

total repertoire can be deduced from exhaustive sequencing of the band. This 
sequencing is now made possible since only a known fraction of all the VII5- 
JH2 rearrangement will be considerered. Exhaustive sequencing of a given 
purified CDR3 band or a total rearrangement is considered to be achieved when 

25 the total number of different sequences versus the total number of clones 

sequenced reach a plateau (Arstila and al. (1999). Science 286, 958-6 l)Table 4 
show that the plateau in the number of different sequences is reached for 72 
different sequences for the 8aa band. Aglobal CDR3 diversity of 5.8 10 6 can be 
calculated from the Baa band dividing the number of sequences by the product 

30 of the percentage of usage of VMS, the percentage of usage of JH2 and the 
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percentage of the 8aa band among VH5-JH2 rearrangement. Similar results are 
obtained when other CDR3 bands arc purified (data not shown). 

As IgM positive B cells are by far the most frequent subset among PBMC 
(Table 1), the inventors then focused on the determination of repertoire size of 
these cells. For this purpose, the inventors decided to concentrate on VH5-JHI 
rearrangement because on one hand, Jill gene segment is the less utilized 
among all JH fragments (0.2%) (Table 3) and on the other hand exhaustive 
sequencing was possible on the whole VH5-J111 rearrangement. In this respect, 
cDNA from donor 789 was PCR amplified using the VH gene family and the 
Cn specific primers, followed by "run-off reaction using a VH5-Fam 
fluorescent probe (Table 2). 'fhe specific VH5-JH1 immunoscope profile is 
shown in Figure IB. The inventors succeeded in the exhaustive sequencing of 
VJT5-JH1 rearrangement of donor 789, although a total number of one thousand 
sequences had to be performed to reach a plateau of 346 different sequences 
(Table 4). A global CDR3 diversity of 5.1 10 6 was evaluated for the IgM 
bearing B cells by dividing the number of different sequences by the product of 
the percentage of usage of VTI5 and the percentage of usage of JHl. This 
number is close to the size of the total repertoire* consistent with the prevalence 
of IgM positive B cells among PBMC. 

To confirm this finding and to valididate of this approach, the inventors 
performed global sequencing of the VH5-JH1 rearrangement amplified from 
donor 743, and this was done twice on the two samples W and Z, which harbor 
25 the same number of IgM positive B cells (2.7 10 6 i.e. 1/6 of the total amount of 

purified B cell). The specific VH5-JII1 immunoscope is shown in Figure IB for 
both samples. Exhaustive sequencing of 655 and 675 total sequences was 
performed for sample W and Z respectively, giving 232 and 228 different 
sequences respectively (Tabic 4). The size of the IgM repertoire was then 
30 calculated as described above, leading to a CDR3 diversity of 2.1 xlO 6 and 

2.4x1 0 6 in samples W and Z respectively. Conscqucntely, diversity calculation 
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for the 500 ml blood sample from donor 743 is equal to (W+Z)x3 = 1.3xl0 7 
(Table 4). 

Example 5: Contribution of somatic mutations to diversity and 
5 determination of the general clone size of peripheral B cells 

As compared with T cells, one of the most striking difference in generating 
diversity in B cells is the property to accumulate somatic mutations in thejr 
variable regions. Somatic mutations are supposed to be generated mainly in the 

10 germinal center, as the key phenomenon in affinity maturation and maintenance 

of memory (Wu and al. (2003). J Clin Immunol 23, 235-46). Somatic mutations 
ore not restricted to IgG expressing B cells since 35% of human blood IgM+B 
cells have been reported to display somatic mutations in heavy chain variable 
genes (Klein and al. (1997). Blood 89, 1288-98). The calculations described 

15 above lo define the size of the B cells repertoire take into consideration CDR3 

diversity* but not the diversity generated by somatic mutations. The inventors 
studied first these mutations that were detected in two clones during the 
exhaustive sequencing of the VH5-JH2 and VH5-JHI rearrangements. By 
definition, the inventors consider that belong to the same clone all the sequences 

20 bearing the same CUR3. Using this criteria, it is possible to draw phylogenic 

trees of the mutations. Clone A originates from the sequencing of VH5-JH2 
rearrangement from donor 789, it bears a CDR3 sequence of 14 aminoaeids, it 
expresses an IgM receptor and has been found 67 times. Clone B originates 
from the sequencing of VH5-JH1 rearrangement from donor 743 , it bears a 

25 CDR3 sequence of 25 aminoaeids, i expresses an IgM receptor and have been 

found in 130 sequences. Both clone A and B mutations trees are shown in 
Figure 2. These two clones display very different patterns of mutations. For 
clone A, only 3 sequences are found in gcrmline configuration and the vast 
majority of sub-clones are the ones that accumulate the highest numbers of 

30 mutations (sub-clones I and L). Most of those mutations are related and can 

be deduced from a linear tree. Out of 21 codons subjected to mutations, only 3 
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give rise to silent mutations. On the other hand, clone B displays 33 sequences 
without mutations and many sub-clones bearing non related mutations could be 
directly deduced from the germline sequence, leading to a "star" pattern of 
mutations. Only sub-clones bearing few number of mutations show a large 
5 accumulation (sub-clone N). Out of 35 codons subjected to mutations, 10 gave 

rise to silent mutations. 

If those two examples of clonal mutations can be informative in the follow-up of 
a given immune response, they are not significant cnought to evaluate the 

10 contribution of somatic mutations to the diversity. In order to calculate the 

global contribution of somatic mutations to diversity, the inventors have 
compared all the VH sequences obtained from exhaustive sequencing of the 
different rearrangements to the germline VT-1 irrespectively of the CDR3 
sequences. Results are shown in Table4. For both donors and for both VU5-JH2 

15 and VH5-JH1 rearrangements, approximately half of the VH sequences are 

found bearing one Or several somatic mutations. Taking into account this B cell 
specific additional source of diversity, the global repertoire size of the studied 
blood samples can be estimated (Table 4). For donor 789, the value obtained for 
500ml blood sample ranges from 6.7 x 10 6 to 10 7 depending of the 

20 rearrangement studied, the methodology used (with or without band 

purification), and the specificity of the PGR amplification (all isotypes or only 
IgM). Taking into account a degree of uncertainty inherent to the approach used, 
the VH repertoire of B cell repertoire is around one order of magnitude larger 
than the VB T cells repertoire (Arstila and al. (1999). Science 286, 958-6 J). 

25 

This result has other implications in what concern the properties of the D cells 
repertoire. The estimated size of the B cell repertoire, calculated from a given 
number of B cells, is always very close to this number and this was even more 
manifest when only a small number of B cells was studied, as in W and Z 
30 samples from donor 743. In this case, the repertoire number obtained was even 

slightly larger than the number of B cells, but still included the confidence limits 
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interval (sec Table 3). In other words, in a 500 ml blood poach, according to the 
diversity of their VH genes repertoire, each ft cells express and produce a 
different antibody, with the exception of antigen speciiic clonal expansions. 
Consequently, assuming that the total amount of blood in one given individual is 
5 close to 5 liters, the global S cells clone size must be comprised between 10 and 

1 and therefore the diversity of B cell repertoire mast range accordingly between 
I0 7 and 10*. Again, this result strengthens the differences existing between. T 
cells and B cells repertoire as the T cell VB clone size has been reported to be 
approximative^ equal to 100 (Arstila and ol. (1999). Science 286, 958-61). 

10 

Example 6: Determining the proportion of clonal expansions within one 
given VJXJ rearrangement 



Another property of 13 cells is the capability of developing very large clonal 

15 expansions in response to antigenic stimulation. Clonal expansion can be 

defined according to different criteria : 1 °), it corresponds to all sequences 
sharing the same CDR3 sequence, irrespective on the presence or not of somatic 
mutations in their V regions. The inventors have used this definition to represent 
trees of mutations for a given clone (see Figure 2). In this case, all sequences 

20 sharing complete identity toward all VH region in addition to the CDR3 could 

be generated by exhaustive sequencing. 2°) it correspond to all sequences 
sharing identical CDR3 sequences found for a given rearrangement. 3 V ) clonal 
expansions might also be taken in consideration above a certain threshold of 
representation. Tn order to address this issue, the inventors have performed 

25 exhaustive sequencing of the VH5-JH1 rearrangement from two samples 

originated from donor 743, containing the same number of purified B cells. As 
shown in Figure 3, three differents situations can be distinguished depending in 
the number of copies a given sequences can be found. If the number of copie per 
sequence is less dian 5, which represent around 70% of total sequences, there is 

30 only 5 % of overlaping sequences in between samples W and Z, constituted by 

3% of mutated sequences and 2% of non mutated sequences. Similarely, non 
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commun sequences between samples W and Z are almost equaly split in 
mutated and non mutated sequences (48% versus 47%) (Figure 3B). When the 
copie number per sequence ranged between 6 and 10, the non common gcrmline 
sequences present in W or Z still account for half of the sequence (52%) but 
5 shared W and Z sequences have increased Lo 24% (sec Figure 313). This 

intermediate situation, which will he representative of « small clones » 
represent 12% of the total W&Z sequences. Finally, a population of « large 
clones » with a copie number per sequences higher than 10 and representing 
18% of total sequences, are found. Tn this case, the majority of sequences (73%) 

10 are common W and Z sequences and most of then habor mutations (64%). 

Therefore, depending if small clones are taken in consideration or noL, the 
proportion for clonal expansions ranged between 18%> and 30% of all VH5-JH1 
rearrangement sequences from donor 743- A similar proportion of clonal 
expansions were found from VH5-JH1 and VH5-.IH2 rearrangements from 

1 5 donor 749 (Data not shown). As already mentioned above, if clonal expansions 

arc not taken in consideration, 95% of the sequences are found in sample W or 
Sample Z but not in both. This finding strengthen evidences for a small global 
clone size. 

20 DISCUSSION 

1 Q ) The diversity of the Immunoglobulin VH genes repertoire is comprised 
between 10 7 and 1 0 s different expressed rearrangements 

In this study, the inventors report the first size estimate of the human B cells VH 
25 gene repertoire. To reach this goal, the inventors first quantified by real time 

PCR VH gene utilisation in total peripheral B cells isolated from normal donors. 
The inventors then tbcus on the VH5 gene family which display an intermediate 
rale of utilisation and is composed by only few germline genes. JH gene 
segment utilisation was then quantified in all VH5 gene family rearrangements. 
30 Two rearrangements, VH5-JH1 and VII5-JII2, were chosen and extensively 

studied. In the case of VH5-JH1 rearrangement, lmmunoscope was performed in 
order to separate and quantify the representation of 8 and 9 amminoacids long 
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CDR3. The corresponding bands were then purified, cloned and subjected lo 
exhaustive sequencing. Calculations have allowed to estimate the size of the VH 
gene repertoire. For the VH5-JH2 rearrangement, exhaustive sequencing was 
performed on the total PGR product, for all the different CDR3 lengths. In both 
5 case, the estimate of the VH gene global diversity is close to 10 7 for all B cells 

purified from 500ml of human blood. If the total amount of blood of a nonnal 
donor is roughly equal to 5 liters, the global diversity of the human peripheral B 
cell VH gene repertoire must be close tolO 8 expressed rearrangements 

10 2°) The mean global clone size of peripheral B cells is close Co 1 

From the above result on global diversity of the human peripheral B celL it can 
be estimated that the globed clone size of immunoglobulin expressing cells must 
varied between 1 and 10. As a matter of fact, when separated sequencing is 
performed on VH5-JH2 rearrangement from two samples of the same donor 

15 harboring the same number of cells, the size of the repertoire calculated in both 

case is almost identical to this number of cells. The same result was obtained 
with different number of cell in the studied sample* even when the number of 
cells is equal to the B cells total number of the blood gill as for the sequencing 
of VH5-JH1 rearrangement .In addition, a detaillcd study of the sequences of 

20 each or this two samples show that, with the exception of large clonal 

expansions with mutated VH genes, not more than 5% of the sequences are 
found in both samples. Therefore, if B cell diversity from a blood gift can be 
estimated to reach 10 7 different VH and if the mean global clone size for B cell 
is close to 1, the total VH diversity must reach 10*. Alternatively, if B cell mean 

25 global clone sze is closer to 10, the overall size of the VH repertoire of the 

donor should be similar to that of the blood gift, ie 1 0 7 . 

3°) VH-VL pairing is poorly involved the generation of B cells diversity 

If the calculation of the diversty of VIT repertoire always give a number similar 

to that of the number of cells studied, one cau assume that VH genes diversity 
30 can be similar to B cells diversity. On the contrary than for the T cells 

repertoire, VL repertoire and VII-VL pairing are less involved in the diversity of 



15/12 2003 LUN 19:20 FAX 01 44 29 35 99 CABINET REGIMBEAU 



©042/078 



41 



the B cells repertoire than Voc repertoire and Va-Vp paring in the diversity of T 
cells repertoire. If each B cells in the periphery which cannot be considered lo 
belong to a clonal expansion (and therefore representing at least 70% of all B 
cells) is already identify simply according Lo its VH usage, size determination of 
5 the VL repertoire can be assumed to provide similar values than for VH 

calculations. In other words, if VH rearrangement is enought by itsef to define a 
given B cell, each B cell expressing a particular VH-VL pair is necessarily 
single. This situation is completely different to what have been already reported 
for the T cells repertoire, where each Vfl must pair in average with 25 different 
10 a chain leading to a total c*P TCR diversity in the blood not less than 25x1 0 6 

diffcrcnts TCR. Therefore, if both T and B cells repertoire reach 
approximatively the same size of repertoire (between 10 7 and 10 8 different 
expressed rearrangements), the way by which this level of diversity is achieved 
strongly differ for each cell type. In B cells, the size of the repertoire mainly rely 

1 5 on VH CDR3 diversity and on VH somatic mutations. Consecutively, the size of 

the VH repertoire is at least one order of magnitude larger as compared to the 
Vp repertoire. On the contrary, in addition to Vp and Va diversity, T cells 
repertoire diversity result also in the different possibilities of pairing between 
V<?Land Vp chains. The importance of Va-Vp pairing in the T cells repertoire 

20 diversity have been postulated to result in part from several rounds of cell 

divisions between Vp and Va rearrangements, allowing the same Vp to pair 
with difierents Va .This rise the possibility that VL rearrangements could occur 
in B cells shortly after VH rearrangements within a window of time allowing 
less cell divisions than for T cells, preventing a 13 cell clone expressing a given 

25 VH to expand as much as a T cell clone. Evidences showing that, in the mouse, 

VL rearrangements can even occur prior to VH rearrangements are also in line 
with this hypothesis. The importance of Va-Vp pairing in the T cell repertoire 
diversity can also derived from thymocyte positive and negative selection in the 
thymus. The result of this process lead to Lhc induction of apoptosis for a large 

30 majority of cells and selection of a small number according to the specificity of 

their TCR. Therefore, blood circulating T might be more representative of a Va- 
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VP pair selected population while blood circulating B cells, with the exception 
of clonal expansions, could be more closely related lo the bone marrow 
outcome. Tor the T cells, the selection nf the Vnt-Vp pair occur in the thymus 
on the basis of MHC recognition, and allow the local expansion of the cells 
5 expressing the selected TCR. Indeed, the mean clone size for T cell have been 

estimated to reach 100 copies in the periphery. For B cells, on the contrary, our 
results show a much smaller clone size, ranging between 1 and 10. Another 
hypothesis to explain this difference, would be to postulate that bone marrow 
produce B cells with fewer constraints of selection than for T cells in the 
10 thymus. Indeed, two major steps of selection of B cells have been reported to 

occur in the bone marrow. First, once a functional VDJ rearrangement is 
acheived and an |4 heavy chain expressed at the surface of the prc-B cell, failure 
of pairing with the VpreB lambdaS surrogate light chain could prevent further 
differentiation. Second, cells expressing a self reactive BCR have been 

15 described to modify their receptor specificity by a process denominated 

« receptor editing ». Nevertheless, the properties of BCR antigen recognition, 
leading to the ability to identify a three diinentional patern, differ strongly with 
the TCR capacity to recognize a given peptide in the context of a given MTTC. In 
addition, our results being consistant with no or few antigen independent cell 

20 divisions in the bone marrow, its outcome could be more representative of a 

random distribution of BCR than the thymus TCR outcome. Tndeed, being 
deduced from total peripheral B cells, our results do not allow the complete 
assimilation of this population with the bone marrow outcome. Furthermore, in 
the mouse, peripheral B cells have been described to be mainly ligand selected . 

25 In order to gain some insights on this issue, studies on purified B cell 

populations representative of the recent bone marrow emigranl should be 
informative . 

4°) Clonal expansions account ffoir 20 to 30% of a given rearrangement 
Over than 15 years ago, Langman and Colin have proposed, based on informatic 

30 stimulations and on B cells repertoire observations that B cell repertoire could 

he divided in several identical sub-fractions, each sub- fraction beinc a 
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« functional unit » of 10 7 R lymphocytes bearing a repertoire of 10 5 
rearrangements (Lungman and Cohn. 1987), Our findings thai the global R cell 
clone size must be be comprise between one and ten and that most peripheral B 
cells are unique are apparently incompatible with the protccton theory which 
5 postulate, in organims haboring more than 10 7 lymphocytes, a important 

redanduncy of BCR specificities. Paradoxicaly, T cells repertoire properties, as 
ihey were previously described (Arstila et Wagner) seems more compatible with 
a direct interpretation of the protecton theory. First, as already mentionned by 
Casrouge et aL the difference of T cells diversity between mice [Casrouge and 
10 ai. (2000). J Immunol I64 r 5782-7]and human [Arstila and al. (1999). Science 

286 \ 958-61 Jdoes not reflect the much larger difference in total number of 
peripheral T lymphocytes between these two species. Second, T cells global 
clone size appear, in particular in human, to be much larger than B cells one and 
therefore being compatible with the existence of several functional units. 

15 

Another properly of the B cells repertoire hardly seems compatible with our 
results. In both mice and human, B cells produce « natural antibodies » which 
are mainly low affinity IgM antibodies, usualy polyspecific and germ line 
encoded. Those antibodies, which are predominant in newborn life, often 
20 recognise self antigen. Their production is constant over life period and very 

stable, antibodies against given specificities always being found in dififerents 
individuals. How could this property of the B cells repertoire could be achieved 
if almost each peripheral B cell habor a unique specificity 

25 In order to reconcile our findings with both protccton hypothesis and production 

of natural antibodies, the inventors wishes to propound the following 
hypothesis. The maintenance of natural antibody production could be achieved, 
not by a continuous bone marrow outcome, but rather by continously activated 
peripheral B cells, self ligand selected and eventually giving rise to clonal 

30 expansions. Similarcly, protecton theory could only concern the pool of clonal 

expansions among all peripheral B cells. In this case, the stability of natural 
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antibodies production will be ensure mainly by peripheral proliferation of clonal 
expansions, which represent* according lo our results, up to 30% of the total 
number of sequences for one given rearrangement. As illustrated in the two 
examples of clonal expansions described in Fig 2, some sequences belonging to 
5 clonal expansions are in germline configuration (3 for clonal expansion A, 33 

for clonal expansion B), as expected for natural antibody producing clones . 
Some clonal expansions would also account for external antigen driven 
proliferations. The occurrence of mutations should therefore not only allow 
affinity maturation of the B cell repertoire but also help to diversify the natural 
10 antibody production. Indeed, in accordance with what have been determined by 

single cell the inventors found that up to 40% of IgM positive B cells in 
the blood have mutated VTL 



The results provide therefore, in addition to the first determination of the size of 
15 the peripheral B cells repertoire, some hypothesis for a better understanding of 

the mechanisms involved in the maintenance of this repertoire. The quantitative 
approach dcvelopped here could also be a powerfull tool to study B cells 
repertoire disturbances oecuring during pathological and autoimmune diseases. 

20 

Tabic 1: Proportion and absolute numbers of cells expressing different 
isotypc in the blood of two healthy donors 

Blood cells from donor 743 and 522 were stained before and after B cell 
enrishroent with the different antibodies according to the materiel and method 
25 and analysed on a FACScalibur (Becton Dickinson). For each staining, result 

are expressed in % of total cells and corresponding absolute numbers are shown 
in parenthesis. 
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Table 2: Specific primers for ii ceil heavy chain quantitative Immunoscopc 
analysis 

This table shows the list of primers utilized to quantified VH and Jll utilization 
5 by real time PGR. Each primers specilicaly amplify one or several VH, as 

notified. The primers localisation in ihe FR1 or FR3 is also shown. 3* end 
phosphorothioate chemical modification of some primers is mcntionned by a *. 
Only two primers display degenerate sites with R meaning A or G and S 
meaning G or C. 
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Table 3: Vh and J H quantification by real time FCR 

Table 3 A shows ihe mean percentage of V1T utilisation delemiincd by real lime 
PCR lor B cells from donor 789 and 743. JKor donor 743 , this determination was 
5 done separately on the two samples W and Z ? as displayed. Tabic 3D shows the 

JH segment usage in association with the VI 15 gene family. 
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